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ABSTRACT 

We present an application of the fast Independent Component Analysis FastICA 
method to the COBE-DMR 4yr data. Although the signal-to-noise ratio in the 
COBE-DMR data is typically ~ 1, the approach is able to extract the CMB signal 
with high confidence when working at high galactic latitudes. However, the foreground 
emission components have too low a S/N ratio to be reconstructed by this method 
(moreover, the number of components which can be reconstructed is directly limited 
by the number of input channels). 

The reconstructed CMB map shows the expected frequency scaling of the CMB. 
We fit the resulting CMB component for the rms quadrupole normalisation Q rm s-ps 
and primordial spectral index n and find results in excellent agreement with those 
derived from the minimum-noise combination of the 90 and 53 GHz DMR channels 
without galactic emission correction. 

We extend the analysis by including additional channels (priors) such as the 
Haslam map of radio emission at 408 MHz and the DIRBE 140/^m map of galac- 
tic infra-red emission. Subsequently, the FastICA algorithm is able to both detect 
galactic foreground emission and separate it from the dominant CMB signal. Fitting 
the resulting CMB component for Q rm s-ps and n we find good agreement with the 
results from Gorski et al. (1996) in which the galactic emission has been taken into 
account by subtracting that part of the DMR signal observed to be correlated with 
these galactic template maps. FastICA is therefore able to extract foreground emis- 
sion from the DMR data and recover a "clean" CMB component. 

We further investigate the ability of FastICA to evaluate the extent of foreground 
contamination in the COBE-DMR data. We include an all-sky Ha survey (Dickinson, 
Davies & Davis 2003) to determine a reliable free- free template of the diffuse interstel- 
lar medium which we use in conjunction with the previously described synchrotron 
and dust templates. The derived frequency scalings of the recovered foregrounds is 
consistent with previous correlation studies (Banday et al. 2003). In particular we 
find that, after subtraction of the thermal dust emission predicted by the Finkbeiner, 
Davis & Schlegel (1999) model 7, this component is the dominant foreground emission 
at 31.5 GHz. This indicates the presence of an anomalous dust correlated component 
which is well fitted by a power law spectral shape with (3 ~ 2.5 in agreement with 
Banday et al. (2003). 

Key words: methods - data analysis techniques: image processing - cosmic mi- 
crowave background. 
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1 INTRODUCTION 

Current and future CMB space missions such as W MAP 
(launched in June 2001, Bennett et al. 1996) and Planck 
(scheduled for launch in February 2007, Tauber 2000) will 
map the microwave sky emission over the entire sky with an 
unprecedented combination of sensitivity and angular reso- 
lution. In the meantime a large variety of ground-based and 
balloon-borne experiments will provide accurate data over 
various regions of the sky. 

The main goal of such experiments is to produce a 
map cleaned of any contributions from other sources be- 
tween us and the Last Scattering Surface and thus contain- 
ing only genuine CMB anisotropy. From such maps the an- 
gular power spectrum is evaluated and used to determine 
cosmological parameters with high accuracy and thereby 
probe various scenarios for structure formation. Potential 
foreground contaminants include emission from our Galaxy 
(mainly synchrotron, free- free and dust emission), compact 
galactic and extra-galactic sources, and the thermal and 
kinematic Sunyaev-Zel'dovich effect from clusters of galax- 
ies. In order to exploit the cosmological information encoded 
into the CMB angular power spectrum, it is crucial to be 
able to identify and remove such signal components with 
high accuracy and reliability. 

Much recent work has been performed in this area by 
several authors (de Oliveira-Costa & Tegmark 1999, Tenorio 
et al.1999, Hobson et al. 1998, 1999, Stolyarov et al. 2001, 
Prunet et al. 2001) employing a range of techniques from 
the classical Wiener Filter to maximum entropy methods. 
These algorithms are referred to as "non-blind" in the sense 
that they require some a-priori information on the signal 
to be separated (e.g. spatial templates and frequency de- 
pendences of the underlying components, although see Bar- 
reiro et al. 2003 for a variation on this approach using a 
maximum entropy based technique). Recently a blind ap- 
proach has been proposed (Baccigalupi et al. 2001, Maino 
et al. 2002) and applied to simulated sky maps similar to 
those that Planck will produce. The approach appears to 
be very promising in that it is fast and does not need priors 
about the underlying signals (at least for high sensitivity 
missions such as WMAP or Planck). It is therefore pos- 
sible to use such blind algorithms to obtain priors that can 
be fed into classical Bayesian separation algorithms. 

However, to-date these blind algorithms have not been 
applied to real CMB data to assess and validate their reli- 
ability in a real world situation. We present here the first 
application of FastICA to real CMB measurements from 
the COBE-DMR 4yr data. 

This paper is organised as follows: in Section|5|we briefly 
present the component separation problem and the main 
simplifications required for the application of FastICA. In 
Section O and we summarise the basics of the Fas- 
tICA algorithm and its application to COBE-DMR data. 
Section |3] shows the impact of the noise amplitude and dis- 
tribution on FastICA outputs which drives the choice of 
optimal separation technique. The main CMB-related re- 
sults are presented in Section 0] while Section |S] considers 
the implications for foreground emission. A critical discus- 
sion follows in Section |7| 



2 COMPONENT SEPARATION PROBLEM 

Let us suppose that the observed sky radiation is the super- 
position of N different physical processes and its frequency 
and spatial dependences can be factorized into two separated 
terms: 

JV 

x(r,v)=J2sj(r)fj(v). (1) 

This signal is, in general, observed by an experiment 
with M-frequencies through an optical system, whose beam 
pattern is in general modelled at each frequency as a shift- 
invariant point spread function B(r, v). Let us further sup- 
pose that B(r,v) is frequency-independent at least within 
each frequency bandwidth t(y). In addition any real experi- 
ment adds some instrumental noise to the output t v (r). Fol- 
lowing our assumptions the observed signal at a frequency 
v is given by: 

N f 

3=1 

JV 

= B„(r)*5^a„jSj(r) + e„(r) (2) 

3=1 

where * denotes convolution, and 

a vj = J t v (u')fj(v')dv. (3) 

Our data model can be further simplified assuming that 
the radiation pattern of the telescope is frequency indepen- 
dent i.e. B v (r) = Bir). In this case, Eq. (2) can be written 
in vector form as: 

x(r) = As(r) * B(r) + e(r) = As(r) + e(r) , (4) 

where each component, Sj, of the vector s is the correspond- 
ing source function convolved with the B beam pattern. The 
matrix A is the mixing matrix with elements given by the 
a v j coefficients. 

We present this derivation in order to stress data model 
assumptions. It is worth noting that one or more of these 
might be not justified in real cases. For instance both the 
WMAP and Planck experiments will observe the sky ra- 
diation through a telescope with multi-frequency receivers 
and the resulting beam pattern is strongly frequency depen- 
dent. This is up to now one of the main limitation of the 
FastICA approach to astrophysical components separation 
and forced Maino et al. (2002) to further convolve simu- 
lated Planck sky maps in order to obtain similar beam 
functions. Furthermore the noise term e(r) is usually as- 
sumed to be additive, signal-independent, white, Gaussian, 
stationary and uniformly distributed on the sky. These are 
additional strong assumptions since the noise spectrum of 
a real experiment may contain a low-frequency tail due to 
the so-called 1/f noise and different scanning (observing) 
strategies which distribute the integration time on the sky 
in a non-uniform way. 
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2.1 FastICA algorithm 

We briefly summarise here the FastICA algorithm. The 
problem of obtaining both the mixing matrix A and the 
signals s from observed data x is unsolvable if additional in- 
formation is not provided. The ICA approach assumes that 

• the signals s are independent random processes on the 
map domain; 

• all the signal, but at most one, have non-Gaussian dis- 
tribution. 

The strategy we exploit here is described in detail in 
Hyvarinen & Oja (1997) and Hyvarinen (1999) and its ap- 
plication in an astrophysical context can be found in Maino 
ct al. (2002): independent components are extracted max- 
imising a suitable measure of non-Gaussianity. Indeed the 
central limit theorem states that a variable which is a mix- 
ture of independent variables is "more Gaussian" than the 
original ones. Therefore we have to find a transformation 
such that the Gaussianity of the variables is reduced: this 
is equivalent to finding a set of transformed variables that 
are "more independent" than the original ones. Furthermore 
since our data model is a noisy, we have to define a measure 
of non-Gaussianity that is robust against noise, this is the 
the so-called neg-entropy. 

Approximations to neg-entropy have been given by 
Hyvarinen & Oja (2000) and Hyvarinen (1999) and, if the 
noise has the properties assumed previously and its covari- 
ance matrix is known, the Gaussian moments of the trans- 
formed variables y = Wx are shown to be robust estimates 
of the desired functions. Here the matrix W is the separation 
matrix such that the y components are in fact independent. 

The algorithm needs a preprocessing step (Hyvarinen 
1999) in which the input maps are "quasi whitened". This 
reduces the number of unknowns in the problem. Let us 
assume that we know the covariance matrix £ of the instru- 
mental noise; at each frequency, the mean value is removed 
from the data (the offsets of each independent component 
can be recovered at the end of the separation process) and 
their covariance matrix C is evaluated by computing the 
following expectation value: 

C = (xx T > . (5) 

A modified covariance matrix and quasi- whitened data sets 
are respectively given by: 

S = (C-E)- 1/2 £(C-£)- 1/2 , (6) 

x= (C-Er 1/2 x. (7) 

The separation matrix W is estimated row by row i.e. 
one component at a time. Let w be a M-vector such that 
w T x gives one component of the transformed vector y (w T 
is a row of the separation matrix W). In order to find an 
estimation of the transformed vector y that is robust against 
noise, the following iterative algorithm is applied, together 
with a convergence criterion: 

(i) choose an initial vector w 

(ii) update it by means of: 

w now = (xp(w T x)} - (7+ S)(ff'(w T x)) 

where g is a regular non-quadratic function i.e. g(u) = u 3 , 
g(u) — tanh(w) and g(u) = u exp(— u 2 ). 



(iii) normalise w ne w to be a unit vector 

(iv) compare w ncw with the old computed value; if not 
converged come back to (ii); if converged, begin another row. 

This procedure maximises the non-Gaussianity of the 
component w T x. If k rows of the matrix W are found at 
a given time, the k + 1 row is searched for in a sub-space 
orthogonal to the first k rows. To this purpose a orthogo- 
nalisation procedure (e.g. by mean of Gram-Schmidt rule) 
is inserted between point (ii) and (iii) . Once the separation 
matrix W is obtained the underlying components are de- 
rived by using: 

x = W"'y (8) 

This equation allows us to derive the frequency scalings for 
each independent component: the scaling between v and v' 
of the j th component is given by the ratio of W~^/W~,j. 

It is also possible to recover signal-to-noise ratio for the 
reconstructed components. Since the noise covariance £ is 
supposed to be known, noise constrained realization n x for 
each frequency channels can be built. Once we have the sepa- 
ration matrix W, the noise realization in FastICA, outputs 
are given by Wn x . Noise is transformed like signals and an 
estimation of the noise in the reconstructed component is 
achieved. This is quite useful: suppose that we have M fre- 
quency channels but only N < M components have to be 
separated. Using all M channels we can reconstruct M com- 
ponents and the "fictitious" ones will be characterised by a 
signal-to-noise ratio lower than unity. 

2.2 Application to COBE-DMR 4yr data 

The DMR experiment has observed full-sky microwave emis- 
sion at three frequencies (31.5, 53 and 90 GHz). At each fre- 
quency, two separate channels (denoted A and B) measured 
the temperature difference between two horn antennas of 
angular resolution ~ 7° pointing in directions separated by 
60° on the sky. This differencing information was then used 
to reconstruct full-sky maps of the microwave sky. 

The instrumental noise is almost white and Gaussian 
but, due to the scanning strategy, it is not uniformly dis- 
tributed on the sky: variations by a factor of ~ 4 is typically 
observed at 90 and 53 GHz while factor of ~ 6 is found 
at 31.5 GHz. This larger factor reflects the fact that part 
of the original 31 GHz data have been discarded due to a 
well known systematic effect (Kogut et al. 1996). In this 
way the 31.5 GHz channels shows a noise distribution on 
the sky which is substantially different from that at 53 and 
90 GHz. Furthermore the S/N ratio is quite poor: for 10° 
effective resolution smoothed maps this is ~ 0.5, 1.5 and 1 
when combining A and B channels at 31.5, 53 and 90 GHz 
respectively, and becomes ~ 2 when combining together all 
the frequencies. However, no extra-smoothing is required be- 
fore applying FastICA since the angular resolution is the 
same for each channel. 

We have applied the FastICA algorithm to the COBE- 
DMR 4yr data in HEALPix format (Gorski et al. 1999) with 
a resolution parameter Aside = 32 corresponding to 12288 
pixels in the sky with size ~ 1.83°. The A and B radiome- 
ters at each frequency are combined to obtain a sum map 
(A+B)/2. 

Maino et al.(2002) have shown that the optimal CMB 
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reconstruction, both in terms of frequency scaling and off- 
set normalisation, can be achieved by exploiting data at 
high galactic latitudes. We use the extended custom galac- 
tic cut (Banday et al. 1997) to select regions useful for this 
CMB reconstruction. Before applying the code the best fit 
monopole, dipole and quadrupole - computed using only 
those pixels surviving the Galactic cut - are subtracted from 
the maps. FastICA "naturally" removes a monopole from 
the input sky maps but does not do the same for dipole and 
quadrupole distributions. These modes are removed since we 
expect that structures on the largest angular scales are con- 
taminated most significantly by a galactic emission which is 
characterised by a steeply falling power spectrum. Since one 
figure of merit for the separation performed by FastICA is 
the frequency scaling, this procedure minimises the contam- 
ination of the derived CMB scaling by foreground emission. 

We work with all the non-quadratic functions described 
before: g{u) = u 3 , g(u) = tanh(u) and g(u) = u exp(— u 2 ) 
identified by p, t and g, respectively. 

Finally, we make use of the S/N ratio and frequency 
scaling of each reconstructed component to assess its phys- 
ical nature. 



3 IMPACT OF NOISE DISTRIBUTION AND 
AMPLITUDE 

One of the assumptions of FastICA is that the noise is 
Gaussian, stationary and uniformly distributed on the sky. 
The latter is manifestly not true for DMR. In order to vali- 
date the impact of noise amplitude and distribution on the 
reconstruction procedure, we performed ~ 3000 simulations 
of fake CMB skies (characterised by a Harrison-Zeldovich 
initial power law spectrum with an rms quadrupole normal- 
isation of ~ 18/iK) and instrumental noise. Specifically we 
consider three cases: 

(i) DMR nominal noise amplitude and distribution 

(ii) uniform noise distribution (i.e. each pixel is ob- 
served the same number of times) with the noise amplitude 
matched to the mean noise amplitude in the actual DMR 
data (0.322, 0.100 and 0.139 mK at 31.5, 53 and 90 GHz, 
respectively) 

(iii) uniform noise distribution with amplitude reduced 
by a factor of 4 with respect the nominal values 

For each realisation we performed a component sepa- 
ration on the cut sky with the p, g and t functions. We 
have considered the cases where all three frequencies are 
included, or just the 90 and 53 GHz data (since we know 
that the 31.5 GHz channel shows a different noise pattern 
on the sky). In each case we evaluate the frequency scaling 
(between 90 and 53 GHz) of the reconstructed CMB com- 
ponent as well as its angular power spectrum on the cut sky 
exploiting the technique described in Gorski et al. (1994) 
and in Gorski et al. (1996, hereafter G96). 

Table reports the results for the derived frequency 
scaling for the p function (similar results are obtained with 
g and t). The results are almost identical for noise cases ii) 
and iii) with either 2 and 3 DMR frequencies, though nat- 
urally, with higher sensitivity a higher accuracy is achieved 
for the derived CMB frequency scaling. However, the situa- 
tion is quite different for the nominal noise distribution and 



Table 1. Reconstructed CMB frequency scaling between 90 and 
53 GHz for our 3000 MC simulations for three different models 
of the noise distribution as explained in the text. There is a clear 
degradation of the result when the 31.5 GHz channel is included 
in the analysis. 



Combination 


i 


ii 


iii 


31.5:53:90 


0.871±0.477 


0.879±0.188 


0.876±0.019 


53:90 


0.871±0.205 


0.878±0.192 


0.876±0.019 



amplitude (case i). Although the mean value is almost the 
same and consistent with the expected theoretical value, the 
rms is somewhat degraded being 0.477 and 0.205 for 3 and 
2 frequency channels separation, respectively. Therefore a 
first indication of the relevance of the noise distribution is 
derived from an inspection of the frequency scalings. 

A similar situation is found in the analysis of the recon- 
structed CMB component power spectrum as shown in Fig- 
ure The CMB component recovered when all three DMR 
sum maps are inputs to the FastICA algorithm shows an 
angular power spectrum (dot-dashed line) that is suppressed 
with respect to that derived from only 90 and 53 GHz in- 
puts (solid line), although they do agree at the 1-sigma level 
(dark grey and light grey shaded regions for all DMR fre- 
quencies and only 90 and 53 GHz respectively). The reason 
for this behaviour is probably associated with the distinct 
noise distribution of the 31.5 GHz channel relative to the 
other two frequencies, which have considerably more consis- 
tent noise patterns. This drives the FastICA algorithm to 
identify two components: one which has the expected CMB 
frequency scaling and a second with an unphysical (negative) 
frequency scaling (and interestingly a higher S/N ratio) into 
which part of the CMB power has been aliased. 

We conclude that differences in the noise patterns be- 
tween the DMR channels are a potential source of unphysi- 
cal components reconstructed with FastICA, and can have 
a notable effect on the frequency scaling of the reconstructed 
CMB component and its angular power spectrum. Therefore 
in what follows we restrict analysis to the 90 and 53 GHz 
channels only. 



4 RESULTS - CMB 

We have applied FastICA to the 90 and 53 GHz frequency 
channels. The results are almost identical for the three non- 
quadratic functions considered. Table |21 summarises the re- 
constructed CMB frequency scalings and the percentage de- 
viation from the theoretical CMB frequency scaling. Results 
from the p- and ^-functions are consistent in terms of the 
correct frequency scaling and S/N ratio. Results determined 
using the f-non-quadratic function reveal a non-optimal fre- 
quency scaling for the CMB component. Nevertheless the 
correct normalisation could be derived. 

Figure [5] shows the angular power spectrum of the re- 
constructed CMB component for the p-, g- and t-functions 
compared with that computed from the minimum-variance 
noise combination of the 53 and 90 GHz channels. All results 
are consistent although the t-function shows a small excess 
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Figure 1. Reconstructed CMB angular power spectrum from our ~ 3000 Monte Carlo (MC) realisations of fake CMB skies and 
instrumental noise. Left panel shows results from nominal noise case, middle panel is for uniform noise with nominal mean amplitude 
and right panel is for uniform noise with reduced amplitude. Solid lines refer to 53 and 90 GHz combination while dashed lines include 
the 31.5 GHz in the analysis. Light grey shaded area refers to 1-sigma limit for 53 and 90 GHz channels while dark grey is 1-sigma limit 
with also the 31.5 GHz. Those shaded areas nearly fully overlap in all cases but the one with nominal noise. Including the 31.5 GHz 
channel in the analysis leads to power spectra that are suppressed with respect to the case in which it is excluded. Small deviations are 
visible also for the uniform noise and mean nominal amplitude case. 
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Figure 2. Angular power spectra from the 90 and 53 GHz combination of DMR data (default coadded) and from the reconstructed 
CMB ICA components with p-, g- and t-functions. 
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Using the same linear combination coefficients that pro- 
duce the reconstructed CMB components from the input 90 
and 53 GHz maps, it is possible to derive the rms noise 
properties per pixel for the reconstructions. This noise pre- 
scription is used as an input to the likelihood analysis (see 
Gorski et al. 1994 and G96) solving for Q rm s-ps and n. Ta- 



ble reports the results of these fits, whilst Figure |3 plots 
the corresponding likelihood contours. 



The results are consistent for p and g while larger 
Qrms-ps and n values are recovered for t. Nevertheless, 
all are consistent with the fit derived from the optimal 
minimum-variance combination of the 90 and 53 GHz sky 
maps without foreground corrections applied. 
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Figure 3. Likelihood contours in the Q rm s—PS ,n plane for the reconstructed CMB component with FastICA. Also plotted is the 
"classic" result for the minimum noise combination of the 90 and 53 GHz DMR channels. 



Table 2. DMR 90-53 GHz reconstructed CMB frequency scaling 
and S/N ratio. Also reported are the deviations from the theo- 
retical expected CMB frequency scaling. 



ICA form 


Scaling 


A 


S/N 


P 


0.82±0.28 


6.6% 


1.069 


9 


0.91±0.24 


3.0% 


1.065 


t 


0.64±0.23 


30% 


1.072 



Table 3. Q rm s-PS an d n as derived from the CMB compo- 
nent reconstructed with the FastICA algorithm. Both 1- and 2-a 
confidence intervals and the corresponding results from G96 de- 
termined without galactic correction are reported. 



Table 4. Qrms—PS an d n determined from an analysis of the 
CMB component reconstructed by FastICA when additional 
foreground channels (Haslam and DIRBE maps - HD) are in- 
cluded. Both 1- and 2-a limits and the results from G96 after 
galactic correction are reported. 



Map 


Qrms 




n B 




G96+HD 


14 qq+319 


+7.35 
-4.16 


i 97 +0.20 
-0.27 


+0.43 
-0.55 


p+HD 


1 4 rn +2 ' 56 
ii.vo_ 2 56 


+6.39 
-4.16 


, 9 7 +0.25 
-0.27 


+0.47 

-0.57 


9+HD 


14 K7+ 2 - 87 
lt.ui _ 2 .24 


+7.03 
-4.16 


1 29+ - 20 
x - zy -0.27 


+0.46 
-0.54 


f+HD 


11 15+ 2 - 24 


+5.44 
-3.19 


1 34+°- 18 
-L- -0.25 


+0.47 

-0.53 



Map Qrms n B 



G96 


15.63^ 


+7.67 
-4.48 


, 9C+O.22 
± ' ZO -0.25 


+0.45 

-0.53 


P 


15.63t»;» 


+7.35 
-4.48 


i 97+0.25 
± " a ' -0.27 


+0.45 
-0.55 


9 


15.63+2^ 


+7.67 
-4.48 


, 97+O.27 
x - z ' -0.27 


+0.45 
-0.57 


t 


1 fi 27+ 3,51 
1D - z '-2.56 


+7.67 
-4.80 


1 29+ 25 
1 - z3 -0.25 


+0.48 
-0.54 



5 ADDING "PRIORS" ON FOREGROUNDS 

In the previous section we detailed the results of our analy- 
sis of FastICA as applied to the 53 and 90 GHz sky maps 
alone, and stressed the consistency of the cosmological re- 
sults with those determined in G96 without the application 
of any galactic foreground correction. The relevance of this 
comparison can be understood as follows: with only two in- 
put sky maps, the FastICA algorithm can only reconstruct 
two outputs, and we have determined that these correspond 
to CMB together with an unphysical, noise-related compo- 
nent, i.e. the method is not able to reconstruct the galactic 
emission at high galactic latitudes. Indeed, we have verified 
by means of simulations - which include a galactic contribu- 
tion modelled on a DIRBE 140/im template (scaled to the 
DMR frequencies according to the correlation coefficients 
from G96) - that the S/N ratio of the galactic emission in 
the DMR data is too low to allow FastICA to recover it. 
Only when the noise rms is reduced by a factor of 10 rela- 
tive to the actual DMR values can some galactic emission 



be separated by FastICA, although the separation remains 
less than optimal. 

In order to attempt to extract and better separate the 
CMB signal from that foreground emission which is cer- 
tainly present in the data, we have proceeded to add ad- 
ditional channels (priors) using foreground templates such 
as the Haslam map of the diffuse galactic radio emission 
at 408 MHz (Haslam et al. 1982) and the DIRBE map of 
galactic diffuse infra-red emission at 140/im. When applying 
FastICA to the 53 and 90 GHz DMR data in combination 
with the Haslam and DIRBE templates, we have assumed 
that the noise contribution to the template maps is small 
compared to that in the DMR data, which is certainly the 
case. Since we now work with 4 inputs, FastICA returns 
4 derived components. One is again almost consistent with 
noise {S/N lower that 1), one has the CMB frequency scal- 
ing and the expected S/N ratio while the other two show 
features respectively of the Haslam and DIRBE maps. It is 
important to note that FastICA does not completely mix 
the Haslam and DIRBE signals into a single foreground com- 
ponent, but is able to keep them largely separated (see|HJ. 
As before, we fit the CMB component for Q T ms-PS an d n. 
Table 0] reports the results: there is a general agreement be- 
tween our results, especially those determined with the p- 
and g-functions, and those reported in G96 after galactic 
emission has been taken into account using the template 
Haslam and DIRBE maps. 

It is clear that p and g almost give the same results 
while large differences are observed with the t form. This 
may be due to the low S/N ratio of the DMR data: Maino 
et al. (2002) did not observe such behaviour when working 
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with higher fidelity Planck simulations. Inspection of the 
extracted components shows that the frequency scalings of 
the CMB component are almost the same as when only the 
DMR data were used as inputs (variations are 1%) and 

that the extracted signals have also the same S/N ratio. 
We also computed the final rms in the extracted maps and 
compare them with the previous DMR-only case. The rms 
figures are almost the same for p and g (deviations within 
8%) while for the t form we observe a variation in the map 
rms of about 30%. This implies that the t is unable to ad- 
equately separate the underlying components and identify 
them as CMB and galactic signals. 



Table 5. FastICA correlation coefficients between each of the 
DMR frequencies and foreground templates. Results are reported 
for the best calibrated non-quadratic form in each case. Errors 
are derived from simulations. Units are fiK X~ l , where X are 
template units. 



Component 


DMR Map (GHz) 


31.5 


53 


90 


Dust 


5.19±1.55 


2.58±1.09 


1.91±1.14 


Synchrotron 


1.77±0.90 


-0.08±0.67 


-0.35±0.61 


Free-free 


3.19±1.48 


0.86±0.68 


0.98±0.86 



6 RESULTS - FOREGROUND EMISSIONS 

We have analysed the two additional signal components 
extracted by FastICA. These appear to closely resemble 
the Haslam and DIRBE maps, but this is not optimal (the 
Spearman correlation coefficient rs ~ 90%). Furthermore, 
visual inspection reveals that some structure which appears 
in the input Haslam map template appears as "ghosts" (with 
negative signal) in the reconstructed dust map, and vicev- 
ersa as shown in Figure 2] Such behaviour almost certainly 
reflects a more complex spectral signature of the foreground 
components than the method (with only 4 input data sets) is 
able to disentangle. Undoubtedly, the significant large angu- 
lar scale spatial correlation between foregrounds in various 
regions of the sky and lower fidelity of the foregrounds due 
to both the S/N and large angular smoothing of the DMR 
data also play a role. In fact, the extent of the signal aliasing 
depends on the non-quadratic form considered in the anal- 
ysis, and is particularly evident for the t form. The spectral 
indices for these galactic independent components derived 
are not robust to the choice of non-quadratic form and fur- 
thermore are not consistent with what expected from pre- 
vious studies (e.g. G96). However, the CMB component is 
reconstructed well and remains clean from galactic emission 
as results in Table 0] show. 

Interestingly, this means that FastICA can provide in- 
formation on foregrounds, not directly from the (spectral) 
properties of the reconstructed galactic components, but 
rather from information gleaned in the reconstruction of the 
CMB component. Each ICA component is obtained by a 
weighted linear combination of the input data, even though 
the combination coefficients (actually the rows of the matrix 
W) are computed by a non-linear process. This means that 
we can interpret the derived weights effectively as correlation 
coefficients between the foreground templates and the DMR 
data. Indeed, for the previous analysis with the Haslam and 
DIRBE maps, the recovered values are similar to that found 
in G96 using an explicit (minimum \ 2 ) correlation analy- 
sis. Furthermore, the consistency of the results between p, 
g and, to some extent, t, suggest that the method allows a 
robust estimation of the correlation of the DMR data with 
the considered templates. 

Recently Banday et al. (2003) have derived correlation 
coefficients between the DMR data and three foreground 
templates: the Haslam map at 408 MHz, the DIRBE map at 
1 40/xm and a new map of Ha emission (Dickinson, Davies & 
Davis 2003) that could be used to determine a reliable free- 
free template. Furthermore when thermal dust and free-free 



emission are properly accounted for, they found a strong 
anomalous correlation between data at 31.5 GHz and the 
DIRBE map. This component is found to be well fitted by 
a spectral shape of the form v -13 with /3 ~ 2.5. 

We attempt to verify here whether FastICA is able to 
arrive at similar results, and we therefore extend our analysis 
to include the Ha template. An initial consideration is that 
the non-quadratic forms of the FastICA algorithm are in 
general sensitive to different features of the astrophysical 
components, therefore it is possible that a given form may 
or may not be particularly well matched to the detection 
of certain signals on the sky. We therefore calibrate which 
form is best suited to our current studies, by using results 
from Banday et al. (2003) to construct a fake skies with 
CMB, foreground emissions (whose correlation coefficients 
are known a-priori) and ~ 10000 realisations of the DMR 
instrumental noise. In each run a single DMR channel is 
considered together with foreground emission templates and 
each of p, g and t is employed to evaluate the correlation 
coefficients. These simulations also provide an indication of 
the uncertainty in the reconstruction process. Simulations 
show that p and g provide consistent results. 

Table |5] reports our estimates of the correlation coef- 
ficients at each of the DMR frequencies. Figure plots 
our results (filled circles) together with those from Banday 
et al. (2003) (open circles) . There is good general agreement 
between the two sets of results. FastICA also clearly de- 
tects foreground emission correlated with the DIRBE dust 
map at 1 40/im and identifies this as the major foreground 
emission at 31.5 GHz. 

In order to further probe the nature of this anomalous 
dust correlated component, we subtract from the DMR data 
model 7 of thermal dust emission from Finkbeiner, Davis 
and Schlegel (1999, hereafter FDS). This comprises two dust 
components, one at a temperature of 9.6 K with dust emis- 
sivity a = 1.5 and the other at 16.4 K with a = 2.6. The 
results are shown in Table HJ 

From Tableland Fig.^it is clear that, if FDS model 7 
provides an accurate prediction for the thermal dust emis- 
sion at DMR frequencies, then it must account for essen- 
tially the entire dust correlation at 90 GHz. At 31.5 GHz a 
strong anomalous correlation with the DIRBE 140/im dust 
template persists after removal of the thermal dust model, 
whilst the results at 53 GHz seem to indicate the presence of 
both a thermal and anomalous dust correlated component 
which would have roughly equal contributions at a frequency 
of ~ 60 GHz. 

We investigate the spectral behaviour of this anoma- 
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Figure 4. Top panels show the input Haslam 408 MHz (left) and DIRBE 140/^m (right) foreground maps while bottom panels report 
the reconstructed ICA component maps derived with the t-non-quadratic form. The ICA components show large noise and signal ghosts 
from the other component (e.g. the synchrotron North Polar Spur is visible with a negative amplitude in the reconstructed dust map and 
high latitude structures in the DIRBE map are aliased into the reconstructed synchrotron map). Units for Haslam and DIRBE maps are 
K and MJy/sr respectively and are the same for the ICA components (normalized at 90 GHz). 



Table 6. FastICA correlation coefficients between each of the 
DMR frequencies, after thermal dust emission removal following 
model 7 in FDS, and foreground templates. Results are reported 
for the better "calibrated" non-quadratic form in each case. Errors 
are derived from simulations. Units are fiK X -1 , where X are 
template units. 



Component 


DMR Map (GHz) 




31.5 


53 


90 


Dust 


4.74±1.54 


0.70±1.10 


0.32±0.87 


Synchrotron 


1.78±0.87 


0.12±0.65 


0.27±0.57 


Free- free 


3.33±1.56 


1.13±0.72 


0.44±0.87 



Table 7. Power law spectral indices from pairs of frequencies 
and from a fit using data from 31.5 to 90 GHz. We fit for 
A norm (^/i'o) — & where is taken as 31.5 GHz. Also reported 
the 68% confidence level errors. 



031:53 3.67±2.01 
#,3:90 1.48±2.95 



Fitted (3 



2.56 



-1.56 

-0.98 



Fitted A no 



3.90 



+1.54 

0.99 



fact that the ICA method makes no assumptions about the 
cosmological signal present in the data. 



lous dust correlated component by fitting our data points 
(the correlations coefficients with DIRBE 140/im after 
FDS model 7 subtraction) by a power law of the form 
Anorm^/z^o) - ' 3 - Anorm has units of ^K/(MJy sr _1 ) and is 
the amplitude of the emission at the reference frequency vo 
of 31.5 GHz. Table Q shows the results from this analysis: 
it is clear that the frequency behaviour is consistent with a 
spectrum steeper that the typical value of ~ 2.15 expected 
for pure free-free emission (although they are still compati- 
ble with this kind of emission). 

Such conclusions are consistent with those of by Ban- 
day et al. (2003), and are particularly satisfactory due to 
the independence of the two techniques employed, and the 



7 CRITICAL DISCUSSION AND 
CONCLUSION 

In this paper we have applied for the first time the FastICA 
algorithm to real CMB data and specifically to the COBE- 
DMR 4-yr data. 

One of the requirements for applicability of FastICA 
is that noise has to be Gaussian and uniform distributed 
on the sky. Unfortunately, this not the case for the DMR 
observing strategy. Furthermore part of the original data at 
31.5 GHz were removed to eliminate potential contamina- 
tion from a particular systematic effect (Kogut et al. 1996), 
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Figure 5. Derived correlation coefficients between DMR at 31.5, 
53 and 90 GHz and the three foreground emission templates. Dust 
emission is traced by DIRBE map at 140/^m, synchrotron emission 
by Haslam map at 408 MHz and free-free emission by Ho map. 
Filled symbols are results obtained with FastICA while open 
symbols are from Banday et al. (2003) using cross-correlation 
technique. Circles refer to test without subtraction of dust ther- 
mal model while squares are when model 7 of FDS has been 
subtracted from DMR data. 



resulting in a noise distribution that is notably different than 
that at 53 and 90 GHz. Therefore, before applying the tech- 
nique, we have tested the effect of such a non-uniform noise 
distribution on both the frequency scaling and power spec- 
trum reconstruction of the recovered CMB component. We 
run ~ 3000 MC simulations and considered three scenar- 
ios of different noise distribution in the data: one nominal, 
one with uniform noise distribution and nominal mean am- 
plitude and the third with uniform distribution and reduced 
amplitude. Results indicate that the different noise distribu- 
tion of the 31.5 GHz channel results in the identification of 
a spurious non-physical component, and the degradation of 
the frequency scaling of the reconstructed CMB component. 
Furthermore, the angular power spectrum of the latter has 
suppressed amplitude with respect to the case in which only 
the 90 and 53 GHz channels are considered, resulting from 
the aliasing of CMB power into the non-physical component 
associated with the noise asymmetry. 

Further analysis was therefore restricted to include the 
90 and 53 GHz channels only. In this case we obtained 
a CMB reconstruction with a frequency scaling consistent 
with the expected theoretical one. We fit the cosmological 



component with a power-law model parametererised by the 
rms-quadrupole normalisation Q rm s-ps and spectral index 
n, and find that the results are robust against the choice of 
non-quadratic function, completely consistent with those de- 
rived in G96 from the optimal minimum- variance combina- 
tion of the 90 and 53 GHz channels, without foreground cor- 
rections. This is a very important outcome since equivalent 
results have been obtained with independent approaches. 

Simulations with different noise rms properties have 
shown that foreground emission in the DMR data has too 
low a S/N ratio to be detected and separated with Fas- 
tICA. Only with a S/N a factor of 10 or better does 
it become possible to identify foreground emission. How- 
ever, after adding two additional channels - the Haslam 
map or radio emission at 408 MHz and the DIRBE map of 
140/xmgalactic infra-red emission - FastICA is now able to 
recognise foreground emission in the DMR data and separate 
it from CMB signal. The CMB frequency scaling for differ- 
ent FastICA forms are consistent with that found without 
priors on foregrounds. More importantly, we derived values 
for Qrms and n s for this new cleaned CMB reconstruction 
in very good agreement with those reported in G96 after 
foreground correction. 

The situation with the foreground component recon- 
struction is less satisfying, however. We found that the fore- 
ground spectral indices derived from the ICA foreground 
components are not robust against the choice of the Fas- 
tICA non-quadratic form and this could be due to several 
reasons. First of all the different forms tend to preferen- 
tially discriminate between different morphological aspects 
of the foreground emission, with the residuals tending to be 
mixed together. Secondly, the S/N ratio for foregrounds at 
high galactic latitudes is too low for FastICA to obtain 
faithful information on these emissions. Finally, and quite 
realistically, it is likely that the true foreground emission at 
microwave frequencies is different from our priors which are 
based on observations at considerably different wavelengths. 
It is worth noting, however, that Maino et al. (2002) have 
demonstrated that, for higher S/N observations and with 
an exact correspondence between "priors" and foreground 
signals, the algorithm was able to properly extract the un- 
derlying foreground contributions. 

Despite the fact that the spectral behaviour of the de- 
rived foreground components is not properly recovered, our 
results do still allow some information on the non-CMB 
emission to be determined. Since the ICA components are 
linear combinations of the input sky maps, the weightings 
between the various input data used to reconstruct the CMB 
sky can be interpreted as correlation coefficients between the 
DMR data at specific frequencies and the foreground tem- 
plates, in the spirit of the analysis in G96. Indeed, the results 
in Tableware in substantial agreement with this. Following 
the recent work of Banday et al. (2003) we have further con- 
sidered an extra channel using the Ha template provided by 
Dickinson, Davies & Davis (2002). Using these three tem- 
plates, FastICA is able to derive the correlation coefficients 
between the DMR data and foreground templates in very 
good agreement with their results. We found that after sub- 
traction of the dust thermal emission as accounted for by 
model 7 of FDS, a strong dust correlated anomalous compo- 
nent remains, and consitutes the strongest foreground emis- 
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sion at 31.5 GHz. This component is found to be well fitted 
by a power law spectrum with (3 ~ 2.56. 

Blind component separation algorithms such as Fas- 
tICA are promising and can be applied to real data even if 
their assumptions are not completely satisfied. The results 
obtained so far are encouraging and future application to 
CMB datasets (e.g. WMAP and Planck) are foreseen. 
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